Learning Similarity Function for Pronunciation Variations

نویسندگان

  • Einat Naaman
  • Yossi Adi
  • Joseph Keshet
چکیده

A significant source of errors in Automatic Speech Recognition (ASR) systems is due to pronunciation variations which occur in spontaneous and conversational speech. Usually ASR systems use a finite lexicon that provides one or more pronunciations for each word. In this paper, we focus on learning a similarity function between two pronunciations. The pronunciation can be the canonical and the surface pronunciations of the same word or it can be two surface pronunciations of different words. This task generalizes problems such as lexical access (the problem of learning the mapping between words and their possible pronunciations), and defining word neighborhoods. It can also be used to dynamically increase the size of the pronunciation lexicon, or in predicting ASR errors. We propose two methods, which are based on recurrent neural networks, to learn the similarity function. The first is based on binary classification, and the second is based on learning the ranking of the pronunciations. We demonstrate the efficiency of our approach on the task of lexical access using a subset from the Switchboard conversational speech corpus. Results suggest that our method is superior to previous methods which are based on graphical Bayesian methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Similarity Functions for Pronunciation Variations

A significant source of errors in Automatic Speech Recognition (ASR) systems is due to pronunciation variations which occur in spontaneous and conversational speech. Usually ASR systems use a finite lexicon that provides one or more pronunciations for each word. In this paper, we focus on learning a similarity function between two pronunciations. The pronunciations can be the canonical and the ...

متن کامل

Pronunciation Barriers and Computer Assisted Language Learning (CALL): Coping the Demands of 21st Century in Second Language Learning Classroom in Pakistan

Pronunciation of English language is a very important sub-skill of speaking module in second language learning process. However, it is ignored, neglected, and even never gotten least attention by the teachers, administrators, and stakeholders especially in Pakistan. Grammar, vocabulary, and the other linguistic skills such as reading and writing are emphasized whereas pronunciation has never be...

متن کامل

EFL Pronunciation Teaching: A Theoretical Review

This study aims to represent the developing status of pronunciation teaching and presents the current perspectives on pronunciation learning and teaching, coupled with innovative approaches and techniques/activities. It is argued that pronunciation teaching methodologies have changed over decades since the Reform Movement. The exact status of teaching pronunciation appeared first in the Audio L...

متن کامل

The Impact of Computer–Assisted Language Learning (CALL) /Web-Based Instruction on Improving EFL Learners’ Pronunciation Ability

The purpose of this study was to investigate the effect of CALL/Web-based instruction on improving EFL learners’ pronunciation ability. To this end, 85 students who were enrolled in a language institute in Rasht were selected as subjects. These students were given the Oxford Placement Test in order to validate their proficiency levels. They were then divided into two groups of 30 and were...

متن کامل

Unsupervised Pronunciation Adaptation for Off-line Transcription of Japanese Lecture Speeches

Observing that most variations in pronunciation are strongly speaker and speaking style dependent, and that the introduction of pronunciation variants in a speaker-independent recognition system is of limited success, we refrain from applying multiple pronunciation variants in the speakerindependent case and instead introduce pronunciation variants without supervision when specializing the reco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1703.09817  شماره 

صفحات  -

تاریخ انتشار 2017